Investigating the Contribution of Distributional Semantic Information for Dialogue Act Classification
نویسندگان
چکیده
This paper presents a series of experiments in applying compositional distributional semantic models to dialogue act classification. In contrast to the widely used bag-ofwords approach, we build the meaning of an utterance from its parts by composing the distributional word vectors using vector addition and multiplication. We investigate the contribution of word sequence, dialogue act sequence, and distributional information to the performance, and compare with the current state of the art approaches. Our experiment suggests that that distributional information is useful for dialogue act tagging but that simple models of compositionality fail to capture crucial information from word and utterance sequence; more advanced approaches (e.g. sequenceor grammar-driven, such as categorical, word vector composition) are required.
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملSemantic Features for Dialogue Act Recognition
Dialogue act recognition commonly relies on lexical, syntactic, prosodic and/or dialogue history based features. However, few approaches exploit semantic information. The main goal of this paper is thus to propose semantic features and integrate them into a dialogue act recognition task to improve the recognition score. Three different feature computation approaches are proposed, evaluated and ...
متن کاملThe Distribution of Mood An Exploration of Distributional Compositions in Sentiment Classification
Distributional semantics is a research area investigating unsupervised datadriven models for quantifying semantic relatedness. This thesis investigates the possibilities of using distributional semantic models for sentiment classification of utterances, by composing distributional vectors of words in utterances. For evaluation I use a set of manually classified movie reviews. While the purpose ...
متن کاملUnderstanding questions and finding answers: semantic relation annotation to compute the Expected Answer Type
The paper presents an annotation scheme for semantic relations developed and used for question classification and answer extraction in an interactive dialogue based quiz game. The information that forms the content of this game is concerned with biographical facts of famous people’s lives and is often available as unstructured texts on internet, e.g. Wikipedia collection. Questions asked as wel...
متن کاملAutomatic Utterance Segmentation in Instant Messaging Dialogue
Instant Messaging (IM) chat sessions are real-time, text-based conversations which can be analyzed using dialogue-act models. Dialogue acts represent the semantic information of an utterance, however, messages must be segmented into utterances before classification can take place. We describe and compare two statistical methods for automatic utterance segmentation and dialogue-act classificatio...
متن کامل